A New Feature Selection and Feature Contrasting Approach Based on Quality Metric: Application to Efficient Classification of Complex Textual Data
Identifieur interne : 001665 ( Main/Exploration ); précédent : 001664; suivant : 001666A New Feature Selection and Feature Contrasting Approach Based on Quality Metric: Application to Efficient Classification of Complex Textual Data
Auteurs : Jean-Charles Lamirel [France] ; Pascal Cuxac [France] ; Aneesh Sreevallabh Chivukula [Inde] ; Kafil Hajlaoui [France]Source :
- Lecture Notes in Computer Science [ 0302-9743 ]
Abstract
Abstract: Feature maximization is a cluster quality metric which favors clusters with maximum feature representation as regard to their associated data. In this paper we go one step further showing that a straightforward adaptation of such metric can provide a highly efficient feature selection and feature contrasting model in the context of supervised classification. We more especially show that this technique can enhance the performance of classification methods whilst very significantly outperforming (+80%) the state-of-the art feature selection techniques in the case of the classification of unbalanced, highly multidimensional and noisy textual data, with a high degree of similarity between the classes.
Url:
DOI: 10.1007/978-3-642-40319-4_32
Affiliations:
- France, Inde
- Grand Est, Lorraine (région)
- Nancy, Vandœuvre-lès-Nancy
- Centre national de la recherche scientifique, Laboratoire lorrain de recherche en informatique et ses applications, Synalp (Loria), Université de Lorraine
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 001591
- to stream Istex, to step Curation: 001572
- to stream Istex, to step Checkpoint: 000291
- to stream Main, to step Merge: 001677
- to stream Main, to step Curation: 001665
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">A New Feature Selection and Feature Contrasting Approach Based on Quality Metric: Application to Efficient Classification of Complex Textual Data</title>
<author><name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
<affiliation><country>France</country>
<placeName><settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="team" n="7">Synalp (Loria)</orgName>
<orgName type="lab">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="EPST">Centre national de la recherche scientifique</orgName>
</affiliation>
</author>
<author><name sortKey="Cuxac, Pascal" sort="Cuxac, Pascal" uniqKey="Cuxac P" first="Pascal" last="Cuxac">Pascal Cuxac</name>
</author>
<author><name sortKey="Chivukula, Aneesh Sreevallabh" sort="Chivukula, Aneesh Sreevallabh" uniqKey="Chivukula A" first="Aneesh Sreevallabh" last="Chivukula">Aneesh Sreevallabh Chivukula</name>
</author>
<author><name sortKey="Hajlaoui, Kafil" sort="Hajlaoui, Kafil" uniqKey="Hajlaoui K" first="Kafil" last="Hajlaoui">Kafil Hajlaoui</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:5E5E321E04152FC0E1A70514A3E8C0A3194602FD</idno>
<date when="2013" year="2013">2013</date>
<idno type="doi">10.1007/978-3-642-40319-4_32</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HCB-ZKP9VB3P-8/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001591</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">001591</idno>
<idno type="wicri:Area/Istex/Curation">001572</idno>
<idno type="wicri:Area/Istex/Checkpoint">000291</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000291</idno>
<idno type="wicri:doubleKey">0302-9743:2013:Lamirel J:a:new:feature</idno>
<idno type="wicri:Area/Main/Merge">001677</idno>
<idno type="wicri:Area/Main/Curation">001665</idno>
<idno type="wicri:Area/Main/Exploration">001665</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">A New Feature Selection and Feature Contrasting Approach Based on Quality Metric: Application to Efficient Classification of Complex Textual Data</title>
<author><name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>SYNALP Team - LORIA, INRIA Nancy-Grand Est, Vandoeuvre-les-Nancy</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
<settlement type="city" wicri:auto="agglo">Nancy</settlement>
</placeName>
<placeName><settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="team" n="7">Synalp (Loria)</orgName>
<orgName type="lab">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="EPST">Centre national de la recherche scientifique</orgName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
<placeName><settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="team" n="7">Synalp (Loria)</orgName>
<orgName type="lab">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="EPST">Centre national de la recherche scientifique</orgName>
</affiliation>
</author>
<author><name sortKey="Cuxac, Pascal" sort="Cuxac, Pascal" uniqKey="Cuxac P" first="Pascal" last="Cuxac">Pascal Cuxac</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>INIST-CNRS, Vandoeuvre-les-Nancy</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
<settlement type="city" wicri:auto="agglo">Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
</affiliation>
</author>
<author><name sortKey="Chivukula, Aneesh Sreevallabh" sort="Chivukula, Aneesh Sreevallabh" uniqKey="Chivukula A" first="Aneesh Sreevallabh" last="Chivukula">Aneesh Sreevallabh Chivukula</name>
<affiliation wicri:level="1"><country xml:lang="fr">Inde</country>
<wicri:regionArea>Center for Data Engineering, International Institute of Information Technology, Gachibowli, Hyderabad, Andhra Pradesh</wicri:regionArea>
<wicri:noRegion>Andhra Pradesh</wicri:noRegion>
</affiliation>
<affiliation></affiliation>
</author>
<author><name sortKey="Hajlaoui, Kafil" sort="Hajlaoui, Kafil" uniqKey="Hajlaoui K" first="Kafil" last="Hajlaoui">Kafil Hajlaoui</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>INIST-CNRS, Vandoeuvre-les-Nancy</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
<settlement type="city" wicri:auto="agglo">Nancy</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s" type="main" xml:lang="en">Lecture Notes in Computer Science</title>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: Feature maximization is a cluster quality metric which favors clusters with maximum feature representation as regard to their associated data. In this paper we go one step further showing that a straightforward adaptation of such metric can provide a highly efficient feature selection and feature contrasting model in the context of supervised classification. We more especially show that this technique can enhance the performance of classification methods whilst very significantly outperforming (+80%) the state-of-the art feature selection techniques in the case of the classification of unbalanced, highly multidimensional and noisy textual data, with a high degree of similarity between the classes.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
<li>Inde</li>
</country>
<region><li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement><li>Nancy</li>
<li>Vandœuvre-lès-Nancy</li>
</settlement>
<orgName><li>Centre national de la recherche scientifique</li>
<li>Laboratoire lorrain de recherche en informatique et ses applications</li>
<li>Synalp (Loria)</li>
<li>Université de Lorraine</li>
</orgName>
</list>
<tree><country name="France"><region name="Grand Est"><name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
</region>
<name sortKey="Cuxac, Pascal" sort="Cuxac, Pascal" uniqKey="Cuxac P" first="Pascal" last="Cuxac">Pascal Cuxac</name>
<name sortKey="Cuxac, Pascal" sort="Cuxac, Pascal" uniqKey="Cuxac P" first="Pascal" last="Cuxac">Pascal Cuxac</name>
<name sortKey="Hajlaoui, Kafil" sort="Hajlaoui, Kafil" uniqKey="Hajlaoui K" first="Kafil" last="Hajlaoui">Kafil Hajlaoui</name>
<name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
</country>
<country name="Inde"><noRegion><name sortKey="Chivukula, Aneesh Sreevallabh" sort="Chivukula, Aneesh Sreevallabh" uniqKey="Chivukula A" first="Aneesh Sreevallabh" last="Chivukula">Aneesh Sreevallabh Chivukula</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001665 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001665 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:5E5E321E04152FC0E1A70514A3E8C0A3194602FD |texte= A New Feature Selection and Feature Contrasting Approach Based on Quality Metric: Application to Efficient Classification of Complex Textual Data }}
This area was generated with Dilib version V0.6.33. |